61 research outputs found

    Cytotaxonomic characterization and estimation of migration patterns of onchocerciasis vectors (Simulium damnosum sensu lato) in northwestern Ethiopia based on RADSeq data

    Get PDF
    While much progress has been made in the control and elimination of onchocerciasis across Africa, the extent to which vector migration might confound progress towards elimination or result in re-establishment of endemism in areas where transmission has been eliminated remains unclear. In Northern Ethiopia, Metema and Metekel-two foci located near the Sudan border-exhibit continuing transmission. While progress towards elimination has been faster in Metema, there remains a problematic hotspot of transmission. Whether migration from Metekel contributes to this is currently unknown. To assess the role of vector migration from Metekel into Metema, we present a population genomics study of 151 adult female vectors using 47,638 RADseq markers and mtDNA CoI sequencing. From additional cytotaxonomy data we identified a new cytoform in Metema, closely related to S. damnosum s.str, here called the Gondar form. RADseq data strongly indicate the existence of two distinctly differentiated clusters within S. damnosum s.l.: one genotypic cluster found only in Metema, and the second found predominantly in Metekel. Because blackflies from both clusters were found in sympatry (in all four collection sites in Metema), but hybrid genotypes were not detected, there may be reproductive barriers preventing interbreeding. The dominant genotype in Metema was not found in Metekel while the dominant genotype in Metekel was found in Metema, indicating that (at the time of sampling) migration is primarily unidirectional, with flies moving from Metekel to Metema. There was strong differentiation between clusters but little genetic differentiation within clusters, suggesting migration and gene flow of flies within the same genetic cluster are sufficient to prevent genetic divergence between sites. Our results confirm that Metekel and Metema represent different transmission foci, but also indicate a northward movement of vectors between foci that may have epidemiological importance, although its significance requires further study

    On the PATHGROUPS approach to rapid small phylogeny

    Get PDF
    We present a data structure enabling rapid heuristic solution to the ancestral genome reconstruction problem for given phylogenies under genomic rearrangement metrics. The efficiency of the greedy algorithm is due to fast updating of the structure during run time and a simple priority scheme for choosing the next step. Since accuracy deteriorates for sets of highly divergent genomes, we investigate strategies for improving accuracy and expanding the range of data sets where accurate reconstructions can be expected. This includes a more refined priority system, and a two-step look-ahead, as well as iterative local improvements based on a the median version of the problem, incorporating simulated annealing. We apply this to a set of yeast genomes to corroborate a recent gene sequence-based phylogeny

    Phylogenetic representativeness: a new method for evaluating taxon sampling in evolutionary studies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Taxon sampling is a major concern in phylogenetic studies. Incomplete, biased, or improper taxon sampling can lead to misleading results in reconstructing evolutionary relationships. Several theoretical methods are available to optimize taxon choice in phylogenetic analyses. However, most involve some knowledge about the genetic relationships of the group of interest (i.e., the ingroup), or even a well-established phylogeny itself; these data are not always available in general phylogenetic applications.</p> <p>Results</p> <p>We propose a new method to assess taxon sampling developing Clarke and Warwick statistics. This method aims to measure the "phylogenetic representativeness" of a given sample or set of samples and it is based entirely on the pre-existing available taxonomy of the ingroup, which is commonly known to investigators. Moreover, our method also accounts for instability and discordance in taxonomies. A Python-based script suite, called PhyRe, has been developed to implement all analyses we describe in this paper.</p> <p>Conclusions</p> <p>We show that this method is sensitive and allows direct discrimination between representative and unrepresentative samples. It is also informative about the addition of taxa to improve taxonomic coverage of the ingroup. Provided that the investigators' expertise is mandatory in this field, phylogenetic representativeness makes up an objective touchstone in planning phylogenetic studies.</p

    Systematic discovery of unannotated genes in 11 yeast species using a database of orthologous genomic segments

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In standard BLAST searches, no information other than the sequences of the query and the database entries is considered. However, in situations where two genes from different species have only borderline similarity in a BLAST search, the discovery that the genes are located within a region of conserved gene order (synteny) can provide additional evidence that they are orthologs. Thus, for interpreting borderline search results, it would be useful to know whether the syntenic context of a database hit is similar to that of the query. This principle has often been used in investigations of particular genes or genomic regions, but to our knowledge it has never been implemented systematically.</p> <p>Results</p> <p>We made use of the synteny information contained in the Yeast Gene Order Browser database for 11 yeast species to carry out a systematic search for protein-coding genes that were overlooked in the original annotations of one or more yeast genomes but which are syntenic with their orthologs. Such genes tend to have been overlooked because they are short, highly divergent, or contain introns. The key features of our software - called SearchDOGS - are that the database entries are classified into sets of genomic segments that are already known to be orthologous, and that very weak BLAST hits are retained for further analysis if their genomic location is similar to that of the query. Using SearchDOGS we identified 595 additional protein-coding genes among the 11 yeast species, including two new genes in <it>Saccharomyces cerevisiae</it>. We found additional genes for the mating pheromone a-factor in six species including <it>Kluyveromyces lactis</it>.</p> <p>Conclusions</p> <p>SearchDOGS has proven highly successful for identifying overlooked genes in the yeast genomes. We anticipate that our approach can be adapted for study of further groups of species, such as bacterial genomes. More generally, the concept of doing sequence similarity searches against databases to which external information has been added may prove useful in other settings.</p

    Assessing the Value of DNA Barcodes for Molecular Phylogenetics: Effect of Increased Taxon Sampling in Lepidoptera

    Get PDF
    BACKGROUND: A common perception is that DNA barcode datamatrices have limited phylogenetic signal due to the small number of characters available per taxon. However, another school of thought suggests that the massively increased taxon sampling afforded through the use of DNA barcodes may considerably increase the phylogenetic signal present in a datamatrix. Here I test this hypothesis using a large dataset of macrolepidopteran DNA barcodes. METHODOLOGY/PRINCIPAL FINDINGS: Taxon sampling was systematically increased in datamatrices containing macrolepidopteran DNA barcodes. Sixteen family groups were designated as concordance groups and two quantitative measures; the taxon consistency index and the taxon retention index, were used to assess any changes in phylogenetic signal as a result of the increase in taxon sampling. DNA barcodes alone, even with maximal taxon sampling (500 species per family), were not sufficient to reconstruct monophyly of families and increased taxon sampling generally increased the number of clades formed per family. However, the scores indicated a similar level of taxon retention (species from a family clustering together) in the cladograms as the number of species included in the datamatrix was increased, suggesting substantial phylogenetic signal below the 'family' branch. CONCLUSIONS/SIGNIFICANCE: The development of supermatrix, supertree or constrained tree approaches could enable the exploitation of the massive taxon sampling afforded through DNA barcodes for phylogenetics, connecting the twigs resolved by barcodes to the deep branches resolved through phylogenomics

    Distinct Origin of the Y and St Genome in Elymus Species: Evidence from the Analysis of a Large Sample of St Genome Species Using Two Nuclear Genes

    Get PDF
    Previous cytological and single copy nuclear genes data suggested the St and Y genome in the StY-genomic Elymus species originated from different donors: the St from a diploid species in Pseudoroegneria and the Y from an unknown diploid species, which are now extinct or undiscovered. However, ITS data suggested that the Y and St genome shared the same progenitor although rather few St genome species were studied. In a recent analysis of many samples of St genome species Pseudoroegneria spicata (Pursh) À. Löve suggested that one accession of P. spicata species was the most likely donor of the Y genome. The present study tested whether intraspecific variation during sampling could affect the outcome of analyses to determining the origin of Y genome in allotetraploid StY species. We also explored the evolutionary dynamics of these species.Two single copy nuclear genes, the second largest subunit of RNA polymerase II (RPB2) and the translation elongation factor G (EF-G) sequences from 58 accessions of Pseudoroegneria and Elymus species, together with those from Hordeum (H), Agropyron (P), Australopyrum (W), Lophopyrum (E(e)), Thinopyrum (E(a)), Thinopyrum (E(b)), and Dasypyrum (V) were analyzed using maximum parsimony, maximum likelihood and Bayesian methods. Sequence comparisons among all these genomes revealed that the St and Y genomes are relatively dissimilar. Extensive sequence variations have been detected not only between the sequences from St and Y genome, but also among the sequences from diploid St genome species. Phylogenetic analyses separated the Y sequences from the St sequences.Our results confirmed that St and Y genome in Elymus species have originated from different donors, and demonstrated that intraspecific variation does not affect the identification of genome origin in polyploids. Moreover, sequence data showed evidence to support the suggestion of the genome convergent evolution in allopolyploid StY genome species

    Diverse Forms of RPS9 Splicing Are Part of an Evolving Autoregulatory Circuit

    Get PDF
    Ribosomal proteins are essential to life. While the functions of ribosomal protein-encoding genes (RPGs) are highly conserved, the evolution of their regulatory mechanisms is remarkably dynamic. In Saccharomyces cerevisiae, RPGs are unusual in that they are commonly present as two highly similar gene copies and in that they are over-represented among intron-containing genes. To investigate the role of introns in the regulation of RPG expression, we constructed 16 S. cerevisiae strains with precise deletions of RPG introns. We found that several yeast introns function to repress rather than to increase steady-state mRNA levels. Among these, the RPS9A and RPS9B introns were required for cross-regulation of the two paralogous gene copies, which is consistent with the duplication of an autoregulatory circuit. To test for similar intron function in animals, we performed an experimental test and comparative analyses for autoregulation among distantly related animal RPS9 orthologs. Overexpression of an exogenous RpS9 copy in Drosophila melanogaster S2 cells induced alternative splicing and degradation of the endogenous copy by nonsense-mediated decay (NMD). Also, analysis of expressed sequence tag data from distantly related animals, including Homo sapiens and Ciona intestinalis, revealed diverse alternatively-spliced RPS9 isoforms predicted to elicit NMD. We propose that multiple forms of splicing regulation among RPS9 orthologs from various eukaryotes operate analogously to translational repression of the alpha operon by S4, the distant prokaryotic ortholog. Thus, RPS9 orthologs appear to have independently evolved variations on a fundamental autoregulatory circuit

    A Complete Sequence and Transcriptomic Analyses of Date Palm (Phoenix dactylifera L.) Mitochondrial Genome

    Get PDF
    Based on next-generation sequencing data, we assembled the mitochondrial (mt) genome of date palm (Phoenix dactylifera L.) into a circular molecule of 715,001 bp in length. The mt genome of P. dactylifera encodes 38 proteins, 30 tRNAs, and 3 ribosomal RNAs, which constitute a gene content of 6.5% (46,770 bp) over the full length. The rest, 93.5% of the genome sequence, is comprised of cp (chloroplast)-derived (10.3% with respect to the whole genome length) and non-coding sequences. In the non-coding regions, there are 0.33% tandem and 2.3% long repeats. Our transcriptomic data from eight tissues (root, seed, bud, fruit, green leaf, yellow leaf, female flower, and male flower) showed higher gene expression levels in male flower, root, bud, and female flower, as compared to four other tissues. We identified 120 potential SNPs among three date palm cultivars (Khalas, Fahal, and Sukry), and successfully found seven SNPs in the coding sequences. A phylogenetic analysis, based on 22 conserved genes of 15 representative plant mitochondria, showed that P. dactylifera positions at the root of all sequenced monocot mt genomes. In addition, consistent with previous discoveries, there are three co-transcribed gene clusters–18S-5S rRNA, rps3-rpl16 and nad3-rps12–in P. dactylifera, which are highly conserved among all known mitochondrial genomes of angiosperms

    The apicomplexan plastid and its evolution

    Get PDF
    Protistan species belonging to the phylum Apicomplexa have a non-photosynthetic secondary plastid—the apicoplast. Although its tiny genome and even the entire nuclear genome has been sequenced for several organisms bearing the organelle, the reason for its existence remains largely obscure. Some of the functions of the apicoplast, including housekeeping ones, are significantly different from those of other plastids, possibly due to the organelle’s unique symbiotic origin
    corecore